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^\ ^ Abstract 



We present a system for the investigation of computational properties of categorial gram- 
mar parsing based on a labelled analytic tableaux theorem prover. This proof method allows 
' us to take a modular approach, in which the basic grammar can be kept constant, while a range 

, of categorial calculi can be captured by assigning different properties to the labelling algebra. 

. The theorem proving strategy is particularly well suited to the treatment of categorial gram- 

mar, because it allows us to distribute the computational cost between the algorithm which 
deals with the grammatical types and the algebraic checker which constrains the derivation. 



1 Background 

A current trend in logic is to attempt to incorporate semantic information into the domain of 
deduction, Q. An area for which this strategy is particularly useful is the problem of 

categorial grammar parsing. The categorial grammar research programme requires the use of 
■ a range of logical calculi for linguistic description. Some researchers have considered labelled 

deduction as a tool for implementing categorial parsers [0 , jl^ , and this paper can be seen as a 
new contribution to this field. 

In this paper we aim for a modular approach, in which the basic grammar is kept constant, 
while different calculi can be implemented and experimented with by constraining the derivations 
produced by the theorem prover. At present, our system covers the classical Lambek Calculus, L, 
as well as the non-associative Lambek calculus NL, and variants such as Van Benthem's |^ 
LP, LPC, LPE and LPCE, and their non-associative counterparts. The system is based on labelled 
analytic deduction, particularly on the LKE method, developed by D'Agostino and Gabbay 0|. 
LKE is similar to a Smullyan-style tableau system, in which the derivations obey the sub-formula 
principle, but it improves on efficiency by restricting the number of branching rules to just one. 
Different categorial logics are handled by assigning different properties to the labelling algebra, 
while the basic syntactic apparatus remains the same. This allows the user to experiment with 
various linguistic properties without having in principle to modify the grammar itself. 



The basic structure of the paper is as follows. In section 1.1, we introduce the family of 
categorial calculi, and discuss some of the linguistic arguments which have been put forward in 
the literature with regard to these calculi. In section we introduce the logical apparatus on 
which the system is based, describe the algorithm and prove some of the properties mentioned 

* Supported by CNPq Brazilian Research Council Research Studentship No. 200210/93-9 
t Supported by ESRC Research Studentship No. R00429334338 
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in section 1.1 within this framework. We also show how different grammars can be characterised 



and present a worked example. In section 2.3 the system is compared with other strategies for 
dealing with multiple categorial logics, such as hybrid formalisms and unification-based Gentzen- 
style deduction. In this section, we also suggest some ways to improve the efficiency of the system, 
and strategies for dealing with the complexity of labelled unification. 

1.1 A Family of Categorial Calculi and Their Linguistic Applications 

Categorial Grammars can be formalised in terms of a hierarchy of well understood and mathemat- 
ically transparent logics, which yield as theorems a range of combinatorial operations. However 
the precise nature of the combinatorial power required for an adequate characterisation of natural 
language is still very much a matter of debate. For this reason, it is desirable to have a means of 
systematically testing the linguistic consequences of adopting various calculi. In this section we 
give an overview of the linguistic applications of some of the calculi in the hierarchy, with a view 
towards motivating the usefulness of a generic categorial theorem prover as a tool for linguistic 
study. 

The combinatorial possibilities of expressions in general can be characterised in terms of re- 
duction laws. In R1-R6 below, we give some reduction laws discussed in [|l6|] , which have been 
found to be linguistically useful.]^ 

Rl: Application R2: Composition R3: Associativity 

X/Y, Y h X X/Y, Y/Z h X/Z (Z\X)/Y h Z\(X/Y) 

Y, Y\XI-X Z\Y, Y\XI-Z\X Z\(X/Y) h (Z\X)/Y 

R4: Lifting R5: Division (main functor) R6: Division (sub. functor) 

X h Y/(X\Y) X/Y h (X/Z)/(Y/Z) X/Y h (Z/X)\(Z/Y) 

X h (Y/X)\Y Y\X h (Z\Y)\(Z\X) Y\X h (Y\Z)/(X\Z) 

It is possible to define a hierarchy of logical calculi, each of which admits one or more of R1-R6 
as theorems; from the purely applicative calculus AB, of Ajdukiewicz and Bar-Hillcl, ||^, which 
supports only to the full Lambek calculus L, which supports all the above laws. Calculi inter- 
mediate in power between AB and L have been explored (e.g. Dependency Categorial Grammar 
pO|), as well as stronger calculi which extend the power of L through the addition of structural 
rules. 

Much of the interest in using categorial grammars for linguistic research derives from the 
possibilities they offer for characterizing a flexible notion of constituency. This has been found 
particularly useful in the development of theories of coordination, and incremental interpretation. 
For example, assuming standard lexical type assignments, the following right node raised sentence 
cannot be derived in AB, but does receive a derivation in a system which includes R3, with each 
conjunct assigned the type indicated. 

(1) [John resents s/np] ^^J^d [Peter envies s/np] Mary 

A calculus which includes composition, R2, will allow a function to apply to an unsaturated 
argument, and it is this property which allows Ades and Steedman [|l| to treat long distance 
dependencies, and motivates much of Steedman's later work on incremental interpretation. 

Dowty [^ uses the combination of composition, R2 and lifting, R4, to derive examples of 
non-constituent coordination such as John gave mary a book and Susan a record. 

We can increase the power of L by adding the structural transformations Permutation, Contrac- 
tion and Expansion, to derive the calculi LP, LPC, LPE and LPCE. The structural transformation 
Permutation, which removes the restrictions on the linear order of types, allows us to go beyond 
the purely concatenative derivations of L. This allows us to deal with sentences exhibiting non- 
standard constituent order. For example, Moortgat suggests using permutation for dealing with 
heavy NP-shift in examples similar to the following : 

^We adopt the Lambek notation, in which X/Y is a function which "takes" a Y to its right to yield an X, and 
Y\X is a function which "takes" a Y to its left to yield an X. 
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(2) John gave [to his nephew pp] [aU the old comic books which he'd coUected in his troubled 

adolescence np\- 



In (|^), the bracketed constituents can be "rearranged" via permutation so that a derivation is 
possible that employs the standard type ((NP\ S)/PP)/NP for the ditransitive verb gave. 

In L, while it is possible to specify a type missing an argument on its left or right periphery, 
it is not possible to specify a type missing an argument "somewhere in the middle" , making it 
impossible to deal with non-peripheral extraction. However, as Morrill et al show, permutation 
provides the additional power necessary to account for this phenomenon Q . 

In addition to permutation, there are also linguistic examples which motivate contraction (e.g. 
gapping, and expansion (e.g. right dislocation, @). However it is universally recognized 
that a system employing the unrestricted use of structural transformations would be far too 
powerful for any useful linguistic application, since it would allow arbitrary word order variation, 
copying and deletion. For this reason, a goal of current research is to build a system in which 
the resource freedom of the more powerful calculi can be exploited when required, while the basic 
resource sensitivity of L is retained in the general case. One such approach is to employ structural 
modalities fl^ , which are operators that explicitly mark those types which are permitted to be 
manipulated by specific structural transformations 

2 A framework for Categorial Deduction 

In this section we describe the theorem proving framework for categorial deduction. We start by 
setting up basic ideas of categorial logic, giving formal definitions of the core logical language. 
Then we move on to the theorem proving strategy, introducing the LKE approach |Q and the 
algebraic apparatus used to characterise different calculi. 

2.1 The core syntax 

We assume that there is a finite set of atomic grammatical categories which will be represented by 
special symbols: NP for noun phrases, S for sentences, etc. So, the set of well-formed categories 
can be defined as below. 

Definition 1 The set of well-formed categories, C is the smallest set which contains every basic 
category and which is closed under the following rule: 

(i) If X eC and Y eC , then X/Y, X\Y and X • YeC 

Our purpose in this section is to define a prcedure which will enable us to verify, given an 
entailment relation h , whether or not such a relation holds for the logic being considered. 

Many proof procedures for classical logic have been proposed: natural deduction, Gentzen's 
sequents, analytic (SmuUyan style) tableaux, etc. Among these, methods which conform to the 
sub-formula principle are particularly interesting, as far as automation is concerned. See for 
a survey. Most of these methods, along with proof methods developed for resource logics, such as 
Girard's proof nets (a variant of Bibel's connection method), can be used for categorial logic. Leslie 
presents and compares some categorial versions of these procedures for the standard Lambek 
calculus L, taking into account complexity and proof presentation issues. Although tableau systems 
are not discussed in [Q, a close relative, the cut-free sequent calculus is presented as being the 
one which represents the best compromise between implementability and display of the proof. 

SmuUyan style tableau systems, however, have been shown to be inherently inefficient They 
cannot even simulate truth-tables in polynomial time. The main reason for this is the fact that 
many of the SmuUyan tableau expansion rules cause the proof tree to branch, thus increasing the 
complexity of the search. Moreover, keeping track of the structure of the derivations represents 

^Systems which allow the selective use of structural transformations may be implemented in the general frame- 
work presented here, although we do not address this issue. 
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an extra source of complexity, which in most categorial parsers 17, |19| is reflected in expensive 
unification algorithms employed for dealing with substructural implication. In order to cope 
with efficiency and generality, we have chosen the LKE system Q as the proof theoretic basis 
of our approach^. LKE is an analytic (its derivations exhibit the sub-formula property) method 
of proof by refutation^ which has only one branching rule. In addition, its formulae are labelled 
according to a labelling algebra which will determine the closure conditions for the proof trees^. 
In what follows, we shall concentrate on explaining our version of the system, the heuristics that 
we have found useful for dealing with particularities of the calculi covered, and the relevant results 
for these calculi. The usual completeness and soundness results (with respect to the algebraic 
semantic provided) are already given in so we will not discuss them here. 

We have mentioned that the condition for a branch to be considered closed in a standard 
tableau is that both a formula and its negation occur on it. The calculus defined above presents 
no negation, though. So, we have to appeal to some extrinsic mechanism to express contradiction. 
In SmuUyan's original formulation, the formulae occurring in a derivation were all preceded by 
signs: T or F. For instance, assume that we want to prove A A in classical logic. We start by 
saying that the formula is false, prefixing it by F, and try to find a refutation for F A ^ A. For this 
to be the case both T A (the antecedent) and F A (the consequent) have to be the case, yielding a 
contradiction. In classical logic we can interpret T and F as assertion and denial respectively, and 
so we can incorporate F into the language as negation, obtaining uniform notation by eliminating 
the need for signed formulae. In our approach, since negation is not defined in the language, we 
shall make use of signed formulae as proof theoretic devices. T and F will be used to indicate 
whether or not a certain string available for combination to produce a new one. 

2.2 The generalised parsing strategy 

If we had restricted the system to dealing with signed formulae, we would have a proof procedure for 
an implicational fragment of standard prepositional logic enriched with backwards implication and 
conjunction. However, we have seen that the Lambck calculus does not exhibit any of the structural 
properties of standard logic, and that different calculi may be obtained by varying structural 
transformations. Therefore, we need a mechanism for keeping track of the structure of our proofs. 
This mechanism is provided by labelling each formula in the derivation with information token^. 

Labels will act not only as mechanisms for encoding the structure of the proof, from a proof- 
theoretic perspective, but will also serve as means to propagate semantic information through the 
derivation. A label can be seen as an information token supporting the information conveyed by 
the signalled formula it labels. Tokens may convey different degrees of informativeness, so we 
shall assume that they are ordered by an anti-symmetric, reflexive and transitive relation, □ , so 
that an expression like x C y asserts that y is at least as informative as x (i.e. verifies at least 
as many sentences as x). We also assume that this semantic relation, "verifies", is closed under 
deductibility. 

It is natural to suppose that, as well as categories, information tokens can be composed. We 
have seen that a type S/NP can combine with a type NP to produce an S. If we assume that there 
are tokens x and y verifying respectively S/NP and NP, how would we represent the token that 
verifies S? Firstly, we define a token composition operation o . Then, we assume that, a priori, 
the order in which the categories appear in the string matters. So, a minimal information token 
verifying S would be x o y. As we shall see below, the constraints we impose on o will ultimately 
determine which inferences will be valid. For instance, if we assume that the order in which the 
types occur is not relevant, then we may allow permutation on the operands, so that x o y C y 

^For standard propositional logic, it has been shown that LKE can simulate standard tableau in polynomial 
time, but the converse is not true. 
*A formula i s p' 



^See section 2.3 



oved by building a counter-model for its negation. 

for a discussion of how LKE, unlike proof nets or standard tableaux, enables us to reduce the 
computaiicmal cost of label unification. 

®See [hl| for a proof theoretic motivated, LDS approach, and |fe|| for an approach based on a finer-grained, 
semantically motivated information structure. 
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o x; if we assume that contraction is a structural property of the calculus then the string [S /NP, 
NP, NP] will also yield an S, since y o y □ y, etc. Let's formalise these notions by defining an 
algebraic structure, called Information frame. 

Definition 2 An Information Frame is a structures C ~ {P ,o where (i) V is a non-empty 

set of information tokens; (ii) V is a complete lattice under CI ; (Hi) o is an order-preserving, 
binary operation onV which satisfies continuity, i.e., /or every directed family {zi}, \_\ {zi o a;} = 
y {zi} o X and |J {a; o Zi\ — x o \_\{zi}; and (iv) 1 is an identity element in V . 

Combinations of types are accounted for in the labelling algebra by the composition opera- 
tor. Now, we need to define an algebraic counterpart for syntactic composition, • , itself. When 
a formula like S/NP • NP is verified by a token x, this is because its components were avail- 
able for combination, and consequently were verified by some other tokens. Now, suppose S/NP 
was verified by a token, say a. What would be the appropriate token for NP, such that S/NP 
combined with NP would be verified by x? It certainly would not be more informative than x. 
Moreover, if the expression S /NP • NP were to stand for the composition of the (informational) 
meanings of its components, then the label for NP would have to verify, when combined with 
a, at most as much information as x. In order to express this, we define the label for NP as 
being the greatest y s.t. x is at least as informative as a combined with y. This token will be 
represented by x ^/a. In general, x y^y |J {z| y o z C x}. An analogous operation, can 
be defined to cope with cases in which it is necessary to find the appropriate label for the first 
operand by reversing the order of the tokens in the definition above. Some properties of 

yo {x^y) n X (3) 1 Q x^x (4) 

(a; / y) o z C {x o z) y y (5) {x / y) ^ z x / {y o z) (6) 

Having set the basic elements of our proof-theoretic apparatus, we are now able to define the 
components of a derivation as follows: 

Definition 3 Signed labelled formulae (SLF) are expressions of the form S Cat : L, where S 
G {T,F}, Cat gC and L € C 

A derivation, or proof will be a tree structure built according to certain syntactic rules. These 
rules will be called expansion rules, since their application will invariably expand the tree structure. 
There are three sorts of expansion rules: those which expand the tree by generating two formulae 
from a single one occurring previously in the derivation, those which expand the tree by combining 
two formulae into a third one which is then added to the tree, and the branching rule. The first kind 
of rule corresponds to what is called a-rule in Smullyan tableaux; these rules will be called a-rules 
here as well. The second and third kinds have no equivalents in standard tableau systems. We 
shall refer to the second kind as cr-rules, and to the branching rule as /3-rule - after SmuUyan's, 
even though his branching rules are different. Figure U summarises the expansion rules to be 
employed by the system. A deduction bar says that if the formula(e) appearing above it occurs 
in the tree, then the formula(e) below it should be added to the tableau. The rules are easily 
interpreted according to the intuitions assigned above to signs, formulae and information tokens. 
A rule like Q:(i), for example, says that if A\B is not available for combination and x verifies such 
information, then this is because there is an A available at some token a, but the combination 
of a and x (notice that the order is relevant) does not produce B. Given the expansion rules, 
the definition of the main data structure to be manipulated by the theorem proving (parsing) 
algorithm is straightforward: a derivation tree, T, is simply a binary tree built from a set of given 
formulae by applying the rules. The next step is to define the conditions for a tree to be regarded 
as complete. Completion along with inconsistency are the notions upon which the algorithm's 
termination depends. It can be readily seen on Figure |I| that for a finite set of formulae, the 
number of times a and a rules can be applied increasing the number of SLFs (nodes) in T is 
finite. Unbounded application of /3, however, might expand the tree indefinitely. In order to assure 
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a-rules 


(i) 


(ii) 


(iii) 


/3-rule 


(ai) 


F A\B : X 


F A/B : X 


T A • B : X 








(aa) 


T A : a* 


T B : a* 


T A : a* 


TA : X 1 (/^a) FA : x 


(as) 


F B : a X 


F A : X a 


T B : X /a 








a-rules 


(i) 


(ii) 


(iii) 


(iv) 


(v) 


(vi) 


(^i) 


T A\B : X 


T A\B : X 


T A/B : X 


T A/B : X 


F A • B : X 


F A • B : X 




T A:y 


F B : y X 


F A: X y 


TB :y 


T A : y 


T B ; y 


(^3) 


TB : y X 


FA:y 


FB : y 


T A : X y 


F B : X /y 


F A : X /y 



*:a new label a (not occurring previously in the derivation) must be introduced. 



Figure 1: Tableau expansion rules 



termination, applications of /? will be restricted to sub-formulae of formulae in T. These notions 
are formalised in Definition y. 

Definition 4 (Tree Completion) Given T , a tree for a set of SLFs S, we say that a binary tree T* 
is a tableau for SifT* results from T through the application of an expansion rule. A tableau T * 
is linearly complete if it satisfies the following conditions: (i) if ai G T , then a2 and € T; (ii) 
if (Ti and oi G T , then 173 S T. A tree T* is complete iff for every A gT* and every sub-formula 
A' of A, both F A' : X and T A' : x have been added to T * by an application of the [3-rule. 

Now, the first step towards building a counter-model for the denial of the formula to be proved is 
the search for a tree containing potential contradictions. Whether or not a potentially inconsistent 
tree is a counter-model for the formula will depend ultimately upon the constraints on the labelling 
algebra. This form of inconsistency is defined below. 

Definition 5 (Branch and Tree Inconsistency) A branch is inconsistent iff for some type X both 
T X and F X, labelled by any information token, occur in the branch. A tree is inconsistent iff its 
branches are all inconsistent. 

Given the definitions above, we are ready to define an algorithm for expanding linearly the deriva- 
tion tree. For efficiency reasons non-branching rules will be exhaustively applied before we move 
on to employing /3-rules. Definition |^ presents the basic procedure for generating linear expansion 
for a branch.]] The complete LKE algorithm. Definition H, which uses the procedure below, will 
be presented after we have discussed tableau closure from the information frame perspective. 

Definition 6 (Algorithm: Linear Completion) Given T , a LKE-tableau structure, we define the 
procedure: 

Li near-Completion (T) 

1 do T <;= Q;-completion(T) 

2 formula <^= head[T] 

3 while ( -icompleted(T) or consistent(T) ) 

4 do if gi -type (formula) 

5 then do formulaa^a; <= search(T,(T2) > formulaaua: is a set of cr2-type slf s 

6 if formulaano; 7^ ^ as-set results from combining cti to each (72 

7 then do era-set combine-labels((Ti,formulaa„a;) 

8 (T3-expansion a-completion(CT3-set) 

9 T<= append(T,(T3-expansion) 

10 do formula ncxt[T] 

11 return T 

'^The reader will notice that if we had allowed a2 formulae to search for ci types for combination, in the same 
way that cris search for o'2S, then linear expansion would not terminate for some cases. Consider for example the 
infinite sequence of cr applications: TA/B:x, TB/A:y, TA:z, TB:yoz, TA;xo(yoz), TB:yo(x 
o (y o z)),... A strategy to allow unrestricted cr-application without running into non-terminating procedures, as 
well as other practical and computational issues is discussed in ||l5|. 
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We have seen above that the labels are means to propagate information about the formulae 
through the derivation tree. From a semantic viewpoint, the calculi addressed in this paper are 
obtained by varying the structure assigned to the set of formulae in the derivatiori|^. Therefore, in 
order to verify whether a branch is closed for a calculus one has to verify whether the information 
frame satisfies the constraints which characterise the calculus. For instance, the standard Lambek 
calculus L does not allow any sort of structural manipulation of formulae apart from associativity; 
LP allows formulae to be permuted; LPE allows permutations and expansion (i.e. if B can be 
proved from the sequent A, A, F, then B can be proved from A, A, A, F); LPC allows permutation 
and contraction; etc. The definition below sets the algebraic counterparts of these properties. 

Definition 7 An information frame is: (i) associative if x o (y o z) 'O (x o y) o z and (x o y) o z 
^ X o (y o z); (ii) commutative if x o y \Z y o x; (Hi) contractive if x o x x; (iv) expansive if x 
^ X o x; (v) monotonic if x \— x o y, for all x, y, z . 

Now, we say that a branch is closed with respect to the labelling algebra if it contains SLFs 
of the form T X : x and F X : y, where x C y. Likewise, a tree is closed if it contains only 
closed branches. Checking for label closure will depend on the calculus being used, and consists 
basically of reducing information token expressions to a normal form, via properties (|^)-(^), and 
then matching tokens and/or variables that might have been introduced by applications of the 
/3-rule according to the properties or combination of properties [Definition^ that characterise the 
calculus considered. It should be noticed that, in addition to the basic algorithm, heuristics might 
be employed to account for specific linguistic aspects. Some examples: (a) it could be assumed 
that all the bracketing for the strings is to the right thus favouring an incremental approach; (b) 
type reuse could be blocked at the level of the formulae, reducing the the computational cost of 
searches for label closure, since most of the calculi in the family covered by the system are resource 
sensitive; (c) priority could be given to juxtaposed strings for cr-rule application, etc. Definition |^ 
gives the general procedure for tableau expansion, abstracted from the heuristics mentioned above. 

Definition 8 (Algorithm: LKE- completion) The complete tableau expansion for a LKE-tree Tis 

given by the following procedure: 

expansion(T) 

1 do closure-flag <^ no 

2 while ^( completed(T) or closed- T= yes) 



3 do linear-completion(T) 

4 jf -iconsistent(T) and label-closure(T) 

5 then do closure-flag <^ yes 

6 else do subformula <^ select-subformula(T) 

7 subformula^ <^ assign-label-T(subformula) 

8 subformula^ <S= assign- label-F (subformula) 

9 Ti <^ append(T,{subformulaT}) 

10 T2 <^= append(T,{subformulap'}) 

11 ]f ( expansion(Ti) — yes and expansion(T2) = yes ) 

12 then do closure-flag <^ yes 



13 return closure-flag 

As it is, the algorithm defined above constitutes a semi-decision procedure. This is due to the 
fact that even though the search space for signed formulae is finite, the search space for the labels 
is infinite. The labels introduced via /3-rules are in fact universally quantified variables which must 
be instantiated during the label unification step. This represents no problem if we are dealing with 
theorems, i.e. trees which actually close. However, for completed trees with an open branch, the 
task might not terminate. In order to overcome this problem and bind the unification procedure 
we restrict label (variable) substitutions to the set of tokens occurring in the derivation — similarly 

*For instance, resource sensitive logics such as linear logic arc frequently characterised in terms of multisets to 
keep track of the "use" of formulae throughout the derivation. 
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to the way parameter instantiation is dealt with by hberahzed quantification rules for first-order 
logic tableaux. In practice, the strategy adopted to reduce label complexity also employs the 
following refinements: (i) the tableau is linearly expanded keeping track of the choices made when 
(T-rules are applied (the options are kept in a stack); (ii) once this first step is finished, if the 
tableau is still open, then backtrack is performed until either the choices left over are exhausted 
or closure is achieved; (iii) only then is the /3-rule applied. This explains the role played by the 
heuristics mentioned abov e We are now able to establish some results regarding the reduction 
laws mentioned in section 1.1. 

Proposition 1 (Reduction Laws) Let X, Y and Z be types, and C an information frame. The 
properties R1-R5 hold: 

Proof 1 The proofs are obtained by straightforward application of Definition ^ and Definition ^. 
Below we illustrate the method by proving (Rl) and (R2): 

(Rl) To prove right application we start by assuming that it is verified by the identity token 1. 
From this we have: 1- T X/Y • Y : m, 2- F X : 1 o m = m. Then, we apply oli^h^-^ to 1 
obtaining 3- T X/Y : n and 4- T Y : m ^n. The next step is to combine 3 and 4 via 
getting 5- T X : n o (m ^ n). Now we have a potential closure caused by 5 and 2. If we 
apply property ^) to the label for 5 we find that no (m y n) □ m, which satisfies the closure 
condition thus closing the tableau. 

(R2) Let's prove left composition. As we did above, we start with: 1- T Z\Y • Y\X : m and 2- F 
Z\X : 1 o m. Applying a^m) to 1 we get: 3- T Z\Y : a and 4- T Y\X : m y a. Now, we 
may apply a^i) to 2 and get: 5- T Z : b and 6- F X : b o m. Then, combining 3 and 5 via 
cr(i): 7- T Y : b o a. And finally 4 and 7 through the same rule: 8- T X : (b o a) o (m y a). 
The closure condition for 8 and 6 is achieved as follows: 

{bo a) o (m y a) \— b o {a o [m y a)) by associativity 

\— b o m by and o being order-preserving _ 

Even though L does not enjoy finite axiomatizability, the results above suggest that the calculus 
finds a natural characterization in LKE for associative information frames. In particular, the 
Division Rule {R6) can be regarded as L's characteristic theorem, since it is not derivable in weaker 
calculi such as AB, NL, and F. If we do not allow associative frames, we get NL. Stronger calculi 
such as LP, LPE, LPC and LPCE can be obtained for the same general framework by assigning 
further properties to o in the labelling algebra. Frames exhibiting combinations of monotonicity, 
expansivity, commutativity and contraction allow us to characterise these substructural calculi. 
Algebras that are both associative and commutative describe LP. Adding expansivity (weakening) 
to LP results in LPE. Associativity, commutativity and contraction describe LPC frames. LPCE is 
obtained by combining the properties of LPC and LPE algebras. 

We end this section with a simple example requiring associativity: show, in L, that an NP 
(John), combined with a type (NP\S)/NP (likes) yields S/NP, i.e a type which combined with a 
NP will result in a sentence (Proof ||). 

Proof 2 Let's assume the following type-string correspondence: NP for John, (NP\S)/NP for 
likes. The expression we want to find a counter-model for is: 1- F NP • (NP\S)/NP S/NP. 
Therefore, the following has to be proved: 2- T NP • (NP\S)/NP : m and 3- F S/NP : m. We 
proceed by breaking 2 and 3 down via on^ai), obtaining: 4- T NP : a, 5- T (NP\S)/NP : my a, 6- 
T NP : b, and 7- F S : (m o b). 

Now we start applying a-rules (annotated on the right-hand side of each line): 

8- T NP\S :(mya)ob 5,6 a 

9- T S : a o ((m y a) o h) 4^9 a(^i) 

We have derived a potential inconsistency between 7 and 9. Turning our attention to the infor- 
mation tokens, we verify closure for L as follows: 

^Furthermore, as we will show in section |2.3| , if associativity is allowed at the syntactic level then it is possible 
to eliminate the branching rule for the class of calculi discussed here. 
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a o ((m y a) o h) C (a o (m y a)) o h by associativity 
□ m o a by property ^) 



2.3 Comparison with Existing Approaches 

Early implementations of C G p arsing relied on cut-free Gentzen sequents implemented via back- 
ward chaining mechanisms ||l6|] . Apart from the fact that it lacks generality, since implementing 
more powerful calculi would involve modifying the code in order to accommodate new structural 
rules, this approach presents several sources of inefficiency. The main ones are: the generate-and- 
test strategy employed to cope with associativity, the non-determinism in the branching rules and 
in rule application itself. The impact of the latter form of non-determinism over efficiency can 
be reduced by testing branches for count invariance prior to their expansion and by performing 
sequent proof normalisation. However, non-determinism due to splitting in the proof structure still 
remains. As we move on to stronger logics and incorporate structural modalities such problems 
tend to get even harder. 

An improved attempt to deal uniformly with multiple calculi is presented in [pT| . In that 
paper, the theorem prover employed is based on proof nets, and the characterisation of different 
calculi is taken care of by labelling the formulae. For substructural calculi stronger than L, much 
of the complexity (perhaps too much) is shifted to the label unification procedures. A strategy 
for improving such procedures by compiling labels into higher-order logic programming clauses is 
presented in for NL and L. However, a comprehensive solution to the problem of binding label 
unification, a problem which arises as we move from sequents to labelled proof nets, has not been 
presented yet. Moreover, as discussed in if we consider that the system is to be used as a 
parser, as a tool for linguistic study, the proof net style of derivation does not provide the clearest 
or most intuitive display of the proofs .P] 

In our approach, the burden of parsing is not so concentrated in label unification but is more 
evenly divided between the theorem prover and the algebraic checker. This is mainly due to the 
fact that the system allows for a controlled degree of non-determinism, present in the cr-rules, which 
enables us to reduce the introduction of variables in the labelling expressions to a minimum. We 
believe this represents an improvement on previous attempts. Besides this, controlling composition 
via bounded backtrack opens the possibility of implementing heuristics reflecting linguistic and 
contextual knowledge. In fact, we verify that, under the appropriate application of rules, we are 
able to eliminate the /3-rule for a class of theorems. 

Proposition 2 (Elimination Theorem) All closed LKE-trees derivable by the application of 
the set of rules TZ = {a(^i),.. .ya^m) ,ai^i^,...,a(^yi) ,f3} can be also derived from TZ — {/?} + { assoc }. 

The proof of this proposition can be done by defining an abstract Gentzen relation, proving 
a substitution lemma with respect to the labelling algebra (as in Q), and showing that our 
consequence relation is closed under the relevant Gentzen conditions even if no P rule is employed. 
The proof appeals to the fact that no formula signed by F can occur in the sequents on the left- 
hand side of the entailment relation, since the calculi presented here do not have negation.]^ We 
believe that this result shows that, even though LKE label unification might be computationally 
expensive for substructural logics in general, the system seems to be well suited for categorial 
logics. We refer the reader to p5[ for a more comprehensive discussion of these issues. 



3 Conclusions and Further Work 

We have described a framework for the study of categorial logics with different degrees of expres- 
sivity on a uniform basis, providing a tool for testing the adequacy of different CGs to a variety 

^"Proof nets and sequent normalisation have also been employed to get around spurious ambiguity (i.e. multiple 
proof for the same sentence, with the same semantics). Our approach does not exhibit this problem. 

^'^Of course, without the f3 rule not all open trees generated will constitute downward saturated sets, since they 
might contain formulae which are not completely analysed. 
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of linguistic phenomena. From a practical point of view, we have investigated the effectiveness 
and generality issues of a parsing strategy for CG opening an avenue for future developments. 
Moreover, we have pointed out some strategies for improving on efficiency and for dealing with 
more expressive languages, including structural modalities. 

The architecture proposed seems promising. Its flexibility with respect to the variety of logics 
it deals with, and its modularity suggest some natural extensions to the present work. Among 
them: implementing a semantic module based on Curry-Howard correspondence between type 
deduction and A-terms, adding local control of structural transformations (structural modalities) 
to the language, increasing expressivity in the information frames for covering calculi weaker than 
L (e.g. Dependency Categorial Grammar ^^), exploiting the derivational structure encoded in 
the labels to define heuristics for models of human attachment preferences etc. Problems for 
further investigation might include: the treatment of polymorphic types (by incorporating rules 
for dealing with quantification analogous to Smullyan's S and 7 rules (1^), and complexity 
issues regarding how the general architecture proposed here would behave under more standard 
theorem proving methods. 
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